Skip to content

Conversation

@DavidePaglieri
Copy link
Contributor

A common problem with BALROG is that models that are weaker at instruction following will struggle to output the action as required in the naive.py agent.

We create a new more robust naive zero-shot agent, that is prompted with the following:

You must choose exactly one of the listed actions and output it strictly in the following format:
<|ACTION|>YOUR_CHOSEN_ACTION</|ACTION|>
You must not output any other text before or after these tags. No explanation, no reasoning, just the action within these tags.

@DavidePaglieri DavidePaglieri merged commit 9c65dad into main Dec 17, 2024
4 checks passed
@DavidePaglieri DavidePaglieri deleted the feat/robust_naive branch December 18, 2024 15:44
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants